Versions:

  • 0.2.4
  • 0.2.3
  • 0.2.2
  • 0.2.1
  • 0.2.0
  • 0.1.3
  • 0.1.2
  • 0.1.1
  • 0.1.0
  • 0.0.9
  • 0.0.8
  • 0.0.7
  • 0.0.6

UI TARS is a GUI Agent application developed by ByteDance that enables users to control their computer through natural language commands, leveraging the UI-TARS Vision-Language Model to interpret and execute tasks visually across the desktop environment. Positioned within the system automation and accessibility software category, the program translates spoken or typed instructions into precise on-screen actions, making it suitable for hands-free navigation, repetitive workflow automation, assistive technology scenarios, and rapid prototyping of UI test scripts. Since its initial release, the project has evolved through thirteen iterative versions, with the current stable distribution numbered 0.2.4 reflecting ongoing refinements in recognition accuracy, latency reduction, and expanded language support. Early adopters range from productivity enthusiasts seeking voice-driven multitasking to quality-assurance teams that require reproducible interaction sequences without manual scripting. By combining computer vision with a language model trained on diverse interface layouts, UI TARS can identify buttons, menus, and input fields across applications and then actuate them according to contextual user intent, effectively turning conversational prompts into cross-program macros. The software is available for free on get.nero.com, with downloads provided via trusted Windows package sources such as winget, always delivering the latest version, and supporting batch installation of multiple applications.

Tags: